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NOVEL HUMAN ESTROGEN RECEPT0R-i3 
Field of the Invention 

This invention pertains to DNA encoding a novel human estrogen receptor-^ 
5 (hER^), hER^ polypeptides, and methods for expressing and isolating hER/J. The invention 
also pertains to methods for using hER(3 to identify coactivators and inhibitors as well as 
tissue-specific estrogens. 

Background of the Invention 

10 The physiological response to steroid hormones is mediated by specific 

interactions of steroids with nuclear receptors, which are ligand-activated transcription factors 
that regulate the expression of target genes by binding to specific DNA response elements. 
These receptors comprise (in an aminoterminal-to-carboxy terminal direction) a hypervariable 
aminoterminal domain that contributes to the transactivation function; a highly conserved 

15 DNA-binding domain responsible for receptor dimerization and specific DNA binding; and 
a carboxyterminal domain involved in ligand-binding, nuclear localization, and ligand- 
dependent transactivation. 

Recently, cDNA was cloned from rat prostate and was shown to have 
significant homology to a previously isolated rat estrogen receptor cDNA. Kuiper et al., 

20 Proc.NatLAcad.Sci.USA 93:5925, 1996. This receptor was designated ER/? to distinguish 
it from a previously cloned receptor, ERa. Rat ER^S was shown to be expressed in the 




prostate, testes, ovary, and thymus, in contrast to ERa, which is most highly expressed in 
the uterus, breast, liver, and pituitary. 

A human ER(3 homologue having the aminoterminal sequence Gly-Tyr-Ser has 
been reported. Mosselman et al., FEBS Letts. 392:49, 1996, This reported sequence lacks 
5 an initiator methionine, however; therefore, the complete aminoterminal sequence could not 
be determined. Thus, the full-length human gene remained unknown and an accurate picture 
of the molecular determinants of the transactivation function of authentic hER/3 could not be 
obtained. 

10 Summary of the Invention 

The present invention provides nucleic acids encoding a full-length human 
estrogen receptor-jS (hERjSL). The nucleic acid sequence of hER/3L, which is depicted in 
Figure 3, SEQ ID N0:1, encodes a receptor having the amino acid sequence depicted in 
Figure 4, SEQ ID N0:2. hER/3L according to the present invention contains 45 amino acids 

15 at its aminoterminus which were not previously known. These amino acids are believed to 
contribute to the transcription activation function of the receptor. 

hERjSL is selectively expressed in the thymus, spleen, ovary, and testes. 
Accordingly, hER^ can be used to identify co-activator proteins that are involved in estrogen- 
regulated gene expression, as well to identify tissue-selective estrogens. 

2 0 The present invention provides isolated polypeptides having the sequence of 

SEQ ID NO: 2 and function-conservative variants thereof which exhibit estrogen-regulated 
transcriptional activation activity. In a related aspect, the invention encompasses isolated 
peptides derived from hER/? comprising a sequence corresponding to amino acids 1-45 of 




SEQ ID N0;2 and function-conservative variants thereof, as explained above. It is believed 
that this sequence provides at least part of the transactivation function. 

The present invention also provides isolated nucleic acids encoding hER/3L and 
hEI^L-derived peptides, including the nucleic acid sequence depicted in Figure 3, SEQ ID 
5 N0:1 and subfragments thereof encoding peptides which comprise amino acids 1-45, as well 
as sequence-conservative and function-conservative variants thereof. Also encompassed by 
the invention are DNA vectors comprising an hERjSL-encoding sequence operably linked to 
a transcription regulatory element and cells comprising these vectors. Methods for producing 
hER/SL-derived polypeptides include incubating a cell comprising an hER/3L-encoding 
10 expression vector under conditions that permit expression of one or more hER/S polypeptides. 
The methods further include: (a) harvesting the cells to produce a cell fraction and a medium 
fraction; and (b) recovering the polypeptide(s) from the cell fraction, medium fraction, or 
both. 

In another aspect, the invention provides methods for identifying hER^S- 
15 interactive compounds, including agonists, antagonists, and co-activator proteins. In one 
embodiment, the method includes: 

(a) contaciing purified hER/3 with a labeled ligand in the presence of test 
compounds, to form test reactions, and in the absence of test compounds, to form control 
reactions; 

2 0 (b) incubating the test and control reactions under appropriate conditions to 

achieve specific binding of the labelled ligand to hER/?; 

(c) determining the level of binding of the labeled ligand to hER(3 in said test 
and control cultures; and 



(d) identifying as a hERiS-interactive compound any compound that reduces 
the binding of the labeled ligand to hER^. 



Brief Description of the Drawings 

^ I [ ) 5 Figure 1 is a schematic illustration of the oligonucleotides used for PGR 

S [ i 

r^)"^ amplification of human estrogen receptor-/? (hER/SiJ cDNA. 

Figure 2 is a schematic illustration of the pcDNA3 plasmid containing hERjSL 

cDNA. 

Figure 3 is an illustration of the full-length cDNA sequence encoding human 
10 estrogen receptor-i3 (SEQ ID N0:1). 

Figure 4 is an illustration of the predicted amino acid sequence of the hER/3 
polypeptide (SEQ ID N0:2). The first 45 (previously unknown) amino acids are underlined. 

Figure 5 is a photographic illustration of an autoradiogram of a 10% SDS- 
polyacrylamide gel in which hERjS in vitro translation products are resolved. Lane 1, hER/3y; 
15 lane 2, hER/J^ produced from a vector encoding a synthetic translation initiation site; lane 3, 
hER/3L produced from a vector encoding the natural hER/3 translation initiation sequences. 

Figure 6A is a graphic illustration of the transcriptional activation capacity of 
full-length hER/? (hERiSJ and truncated hER/3 (hER/Sx) expressed in HepG2 cells. Cells were 
transfected with either hER/^L or hERiSj and co-transfected with a luciferase reporter plasmid 
2 0 containing an estrogen response element (ERE) (ERE.TK. LUC) and a control /3-galactosidase 
plasmid. Cells were incubated in the absence or presence of estradiol, after which luciferase 
activity w^as measured and normalized to /3-galactosidase activity. Figure 6B is a graphic 




illustration of luciferase activity in HepG2 cells transfected with either hER/^L or hERiSj and 
co-transfected with a luciferase reporter plasmid lacking an ERE (TK.LUC) 

Figure 7A is a graphic illustration of the effect of estradiol stimulation of full- 
length hER/3 (hERi^L) and truncated hER/S (hERjSx) on NFkB activation in HepG2 cells. Cells 
5 were transfected with either hERjSL or hERjSy and co-transfected with a luciferase reporter 
plasmid containing three copies of an NFkB binding site (3X-NFkB TK.LUC) and a control 
/3-galactosidase plasmid. Cells were stimulated with interleukin-1/3 and incubated in the 
absence or presence of estradiol, after which luciferase activity was measured and normalized 
to i3-galactosidase activity. Figure 7B is a graphic illustration of luciferase activity in cells 

10 transfected with either hER/?L or hERjSj and co-transfected with a luciferase reporter plasmid 
lacking an NFkB binding site (TK.LUC), 

Figure 8 is a graphic illustration of the transcriptional activation capacity of 
full-length hERiS (hER/SJ and truncated hERi3 (hER/3T) expressed in HAECT-1 human 
endothelial cells. Cells were transfected with either hER/SL or hER/Sj and co-transfected with 

15 a luciferase reporter plasmid containing an ERE (ERE. TK.LUC) or one lacking an ERE 
(TK.LUC). Cells were incubated in the absence or presence of estradiol, after which 
luciferase activity was measured. ERE TK.LUC values were normalized to TK.LUC values 
and are presented as mean jf S.E. (n=4). 

Figure 9 is a graphic illustration of the effect of increasing doses of estrogens 

2 0 (17-iS estradiol or genistein) on the transcriptional activation capacity of full-length hER/3 
(hER/^L) and truncated hER(3 (hER/Sj) expressed in 5. cerevisiae. Cells were transformed 
with either hERj^L or hER/Jj and co-transformed with a jS-galactosidase reporter plasmid 




containing an ERE, Transformed cells were treated with estrogens for 3h and assayed for 
jS-galactosidase activity. 

Detailed Description of the Invention 

5 Human estrogen receptor-jS (hereinafter, hER/3) comprises aminoterminal amino 

acid residues not previously known. The present invention encompasses isolated, purified, 
nucleic acids encoding authentic full-length hER/J of 530 amino acid residues and fragments 
thereof which include nucleic acids encoding amino acids 1-45 of hER(3. The invention also 
encompasses isolated, purified, polypeptides comprising hER/3 and peptides derived 
10 therefrom, particularly peptides which include residues 1-45 of hERj3. The invention also 
provides expression systems in which transcriptionally active hER/3 or fragments derived 
therefrom can be produced, as well as screening methods for identifying hER/3 agonists and 
antagonists (including tissue-specific estrogens and anti-estrogens) as well as hER/3 co- 
activators and inhibitors. 

15 

Isolation and Characterization of the Gene Encoding liERB 

The present inventors have isolated the cDNA encoding hER/3 using the 
methods outlined below. Human testis Poly A+ RNA (1 /xg, Clontech, Palo Alto CA) was 
mixed with 0.5 ^g oligo dT primer (GIBCO-BRL, Gaithersburg MD) in a total volume of 10 
2 0 /xl. The mixture was heated at 70°C for 10 minutes, and, after cooling on ice, was 
supplemented with 500 of each deoxynucleoside triphosphate, IX cDNA synthesis buffer, 
and 10 mM DTT to a final reaction volume of 20 jx\. The mixture was incubated at 42°C 
for 2.5 minutes and then supplemented with 1-2 units reverse transcriptase (GIBCO-BRL, 
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Gaithersburg MD), after which it was incubated at 45 °C for 30 minutes and SC'C for 5 
minutes. One-tenth of this mixture (approximately 2 /xl) containing the cDNA template was 
then used in PGR amplification of hER/S using forward and reverse primers as described 
below. 

\ 

5 Alignment \ of the known rat ER/S sequence (Kuiper et aL, 

\ 

Proc. Natl. Acad. Sci. USA 9^^5925, 1996) with that of a human homologue (Mosselman et al., 
\-' FEBS Letts. 392:49, 1996) sAjggested that the human sequence lacked at least the ultimate and 

* penultimate residues at its aminoterminus, as shown below: 

Rat: N^TFYSPAVMNYS . . . 

10 Human: - -IgYSPAVMNYS . . . 

Based on this information, PGR ij)rimers were designed that supplement the human sequence 
with the two missing aminotermin^l residues M and T and with an artificial Kozak translation 
initiation sequence. The forward primer, having the sequence 
5'-GGAAGGTTGTGGAGGATGAT^GGGGGTATAGGGGTGGTGTGATG-3' 
15 and a reverse primer, having the sequeJ^ce 

5'-GGATGTAGAGTGGAGGGGTGAGTXiAGAGTGAGGGTTGTGG-3^ 
were used to amplify hER^ sequences in i reaction containing the following components: 
2 /xl of the cDNA template described above; IX PGR buffer; 200 of each 
deoxynucleoside triphosphate, 2 units of hot tab polymerase (Amersham, Arlington Heights 
2 0 IL), and l^g of each of the forward and reverse primers. The reaction mixture was heated 
to 95°G for 2 minutes, annealed at 52°G for 1 minute, and amplified using 36 cycles of 72°G 
for 1.5 minutes. 



A fragment of approximately 1500 bp in length was produced. The fragment 
digested with HindlU and Xbal (which cleave at sites present in the forward and reverse 
primer sequences, respectively, but not in the main body of the amplified cDNA sequence) 
and cloned into the corresponding sitps of the pcDNA3 expression vector (Invitrogen, 
Carlsbad CA). This assymetric cloning strategy places the 5' end of hER/3 cDNA under the 
control of the viral CMV promoter in pcpNA3 (Figures 1 and 2). Several insert-containing 

pcDNA3 clones were identified. Plasm^d DNA was prepared from three clones using a 

I 

plasmid purification kit (Qiagen, Santai Clarita CA) and their insert sequences were 
determined by the dideoxy termination metnod. One clone (designated R61010-2.24 or Clone 
3) was found to contain an insert with a nucleotide sequence identical to the published hER(3 

sequence (Mosselman et al., FEBS Letts }\392A9, 1996) and had the following 5* end 

\ 

structure: ^ 

M T G Y . . . \ 
CCATC ATG ACC GGC TAT ... \ 

\, 

This clone was designated "truncated hER/3" or H^R/Sj. 

To verify the am\ioterminal and upstream sequence of human hER/3, two 
independent approaches were taken, as described below. 

(1) 10 fi\ of a human ovary 5'-Stretch cDNA library (Clontech, Palo Alto CA) 
was mixed with 50 fi\ of IX K solution (IX PCR Buffer (GIBCO-BRL, Gaithersburg MD), 
2.5 mM MgCl2, 0.5% Tween-20, 100 pig/ml Proteinase K), and the reaction mixture was 
incubated at 56°C for 2 hours, then at 99''C for 10 minutes. 5 /xl of this reaction mixture 
were then used as template in a nested PCR reation. For the first round, the forward primer 
(pDR2 sequencing primer, Clontech, Palo Alto CA) had the sequence 5'- 



CTGGTAAGTTTAGTCTTriVGTC-3\ and the reverse primer (hER/S-specific, designated 
oligo #12908) had the sequence 5'-GCTTCACACCAAGGACTCTTTTGAG-3\ The 
reaction contained IX Klentaq PGR reaction buffer (40 mM Tricine-KOH, 15 mM KOAc, 
3.5 mM Mg(0Ac)2, 75 ^g/ml bov'^ne serum albumin); 0.2 mM of each dNTP; 0.2 ^M of 
each of the above primers, and 1 uni\ of Klentaq Polymerase Mix (Clontech, Palo Alto CA). 

Touchdown PGR conditions were as follows: 5 cycles of 94 ""C for 2 seconds and 72°G for 

\ 
\ 

4 minutes, followed by 30 cycles of 9|''G for 2 seconds and 67°C for 3 minutes. 

Excess nucleotides and primers were removed from first round PGR reactions 
by purification over Wizard PGR columns (Promega, Madison WI). A second-round PGR 
reaction was then performed using 2 /xl of the purified first-round reaction mixture. For the 
second round, the forward primer was the pDR2 sequencing primer shown above, and the 
reverse primer had the sequence 5'-GTTGGGtAGAAGAGATTTGGGGTTGT-3^ (hER/S- 
specific, designated oligo #13871). The PGR reaction and cycling conditions were identical 
to those employed in the first rouJad. The products were cloned into pGR2. 1 (Invitrogen) and 
three resulting clones were sequenced. All three clones (designated LI, L2, and L3) 
contained hER/? inserts of different lengths, all of which were homologous to hER/? and to 
each other. 

(2) A Marathon Ready thymu^ cDNA kit (Glontech) for 5' rapid amplification 
of cDNA ends (RAGE) was also used to isolated hER/3 5' clones. In the first round of a 
nested PGR reaction, 5 of human thymus Marathon-ready cDNA (Glontech) was used as 
template. The forward primer had the sequence 5'- 
GGATGGTAATAGGAGTGAGTATAGGGG-3' (Adaptor primer 1 , Glontech), and the reverse 
primer had the sequence 5*-GGTTGAGAGGAAGGAGTGTTTTGAG-3' (hER/3-specific, 



, ^ . designated oligo #12908). The PCR reaction and cycling conditions were identical to those 

\\ described in (1) above. \ 

Excess nucleotides and primers were removed from the first round PCR 
reactions by purification over\^Wizard PCR columns (Promega). A second round PCR 
5 reaction was performed using 2 ^ of the purified first round reaction. For the second round, 
the forward primer had the sequence 5^-ACTCACTATAGGGCTCGAGCGGC-3' (nested 
adaptor primer 2, Clontech), \ and the reverse primer had the sequence 5'- 



\ l^' GTTGGCCACAACACATTTGGG0;TTGT-3' (hERi3-specific, designated oligo#13871). The 

)p \ second round PCR reaction and cyclii^g conditions were identical to those employed in the 

\ 

10 first round. The products were cldned into the pCR2.1 vector and two clones were 
sequenced. The two clones contain inseit sequences of different lengths that are homologous 

to hERj3, to each other, and to the sequeiices isolated from a human ovary cDNA library as 

\ 

described above. \ 

All of the hERiS sequences isolated by methods (1) and (2) above contained 1 10 
15 nucleotides corresponding to hERiSj sequences, as well as 228 additional nucleotides at the 
5* end (Figure 3). 

The hERjS cDNA sequence determined from these clones contained several 
important differences from the previously known human sequence. First, the third amino 
acid of the previous sequence was found to be F and not G (see above). Second, the 
2 0 methionine residue at the aminoterminus of the previous sequence was found not to be the 
initiator (i.e., true aminoterminal) residue. Rather, the authentic full-length hER(3 cDNA 
sequence encodes a polypeptide having 530 residues, the first 45 of which are not found in 
the previously known human sequence (Figure 4). The sequence appears to be quite 




homologous to rat ER/3; however, this reading frame was not identified previously (Kuiper 
et al., Proc, Nat L Acad, Sci, USA 93:5925, 1996). Furthermore, an optimal Kozak translation 
initiation sequence is found upstream of the newly discovered initiator methionine codon. A 
termination codon was identified 63 nucleotides upstream to the authentic ATG initiator codon 
5 in the same reading frame. 

The cDNA encoding authentic full-length hERjS was cloned into pCDNA3 
under the control of the CMV promoter; this expression vector was designated "long hER/?" 
or hER^L- 

10 Synthesis of full4en£th hERQ and truncated hERS 

To examine the natural start site for translation of hER/3, three plasmids were 
subjected to coupled transcription-translation, encoding hERiSx (with a synthetic upstream 
translation initiation sequence), hER/^L (with a synthetic upstream translation initiation 
sequence), and hERj3L containing 93 nucleotides of its native upstream sequence (the entire 

15 sequence shown in Figure 3). The plasmids were transcribed and translated using the TNT 
T7 Coupled Reticulocyte Lysate System (Promega #L4610). Circular plasmid DNA was 
purified using Qiagen Maxi-Kit #12362. 2^g of the DNA was transcribed and translated in 
a single reaction in the presence of [^^S] -methionine (New England Nuclear, Boston MA). 
The translation products were resolved on a 10% SDS polyacrylamide gel and were visualized 

2 0 by autoradiography (Figure 5). 

The resulting translation products of both hER^^ products were of similar size 
(-63 kDa), and the hER/Jj product was appropriately shorter (-56 KDa). This indicates 
that the initiator ATG most likely utilized in vivo is the ATG at position 94-96. Utilization 




of a further upstream ATG is unlikely because of a termination codon in-fram with the 
presumed start site. Confirmation of the authentic start site is achieved by subjecting hER/3 
polypeptides to aminoterminal sequencing. 



5 Functional differences between full-length hER8 and truncated liERQ 

The experiments described below were performed to evaluate the transcription 
activation properties of full-length hER/S according to the present invention and to compare 
it with that of truncated hER/3. hER/^L and hER/?-r were expressed in parallel in different cell 
types and tested for their ability to transactivate reporter genes containing estrogen response 
10 elements (EREs). Alternatively, hERjSL and hER/S^ may be expressed in host cells containing 
endogenous estrogen-responsive genes and the estrogen-mediated activation of the endogenous 
genes is measured. 

(i) HepG2 Cells : 

HepG2 cells (ATCC) were transfected in parallel with either pcDNA3-hER/3L 
15 or pcDNA3-hERi3y using the calcium phosphate co-precipitation method. Cells were co- 
transfected with a reporter plasmid containing a luciferase gene preceded by either an ERE 
upstream of the thymidine kinase (TK) basal promoter, or the TK basal promoter alone. 
Cells also received a plasmid encoding /3-galactosidase under the control of an RSV promoter, 
which was used to correct for variation in DNA uptake. Five hours after transfection, cells 
2 0 were incubated with or without lO'^M 17-/3 estradiol for 20 hours, after which cell extracts 
were prepared. Luciferase activity was measured by a chemiluminescent method using the 
Promega luciferase assay system, and /S-galactosidase activity was measured by Galactolight 




(Tropix, Inc., Bedford MA); luciferase activity was then normalized to /J-galactosidase 
activity. 

The results shown in Figure 6A indicate that, in the presence of estradiol, 
hER/3-r caused a 2-fold stimulation of ERE activity. By contrast, hER/^L under the same 
5 conditions caused a 6-fold stimulation of ERE activity. Thus, hER^SL is about 3-fold more 
active than hER/3y in this circumstance. 

In a separate experiment, HepG2 cells were transfected with hER/3L or hER/3y 
as above, but the reporter gene consisted of three copies of an NFkB binding site upstream 
of the TK basal promoter. Transfected cells were incubated with or without interleukin-l/S 

10 (IL-ljS) to activate NFkB and/or with estradiol prior to luciferase determination. The results 
shown in Figure 7 indicate that hER/3L was capable of attenuating the IL-l/S-mediated NFkB 
transcriptional activation (to an extent similar to that observed with hERo:), while hER/3j 
exhibited no inhibitory activity. 

(ii) Human endothelial cells : 

15 HAECT-1 cells (a clonal immortalized human aortic endothelia cell line 

derived by infection with Ad5 ori-SV40 ts A209) were transfected with pcDNAShER/Sy or 
pcDNA3-hER/3L and ERE-luciferase plasmids by elcctroporation. After 4 hours, the cells 
were treated overnight with or without 100 nM 17-/3 estradiol prior to luciferase activity 
measurements. The results shown in Figure 8 indicate that hER/SL is 2-3 times more active 

2 0 than hERjSj in activating the ERE-reporter gene in the presence of estradiol. In independent 
experiments, cells transfected under identical conditions were monitored for their levels of 
estrogen receptors using a ligand binding assay. The results indicate that the increased 
activity of hER/^L relative to hER/J^ is not due to an increase in receptor number or stability, 




and, further, that the 2-3 fold increment measured in the above experiment may be an 
underestimate of the true transactivational capacity of hER/?. 
(iii) Yeast : 

S. cerevisiae strain BJ2168 (Yeast Genetic Stock Center, Berkeley CA) was 
5 co-transformed with an ERE-LacZ reporter plasmid (designated YRpE2) and yeast vectors 
expressing either hER/5L or hER/J^ under the control of the yeast triose phosphate isomerase 
promoter in the yeast pYX242 vector (Rc&D Systems, Minneapolis MN). Transformed cells 
were grown in selective medium for 24 hours, after which they were treated in the presence 
or absence of increasing concentrations of either estradiol or the phytoestrogen Genistein 
10 (Research Biochemical International, Natick MA) for 3 hours prior to determination of jS- 
galactosidase activity. The dose-response results shown in Figure 9 indicate that the maximal 
level of estrogen-stimulated LacZ expression was 2-fold higher in hER/SL-transformed cells 
relative to hER^S^-transformed cells. 



15 DNA, Vectors, and Expression Systems 

Many conventional techniques in molecular biology, microbiology, and 
recombinant DNA, are used in practicing the present invention. See, for example, Sambrook 
et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York; DNA Cloning: A Practical Approach, 

20 Volumes I and II, 1985 (D.N. Glover ed.); Oligonucleotide Synthesis, 1984, (M.L. Gait ed.); 
Nucleic Acid Hybridization, 1985, (Hames andHiggins); Transcription and Translation, 1984 
(Hames and Higgins eds.); Animal Cell Culture, 1986 (R.I. Freshney ed.); Immobilized Cells 
and Enzymes, 1986 (IRL Press); Perbal, 1984, A Practical Guide to Molecular Cloning', the 




series, Methods in Enzymology (Academic Press, Inc.); Gene Transfer Vectors for Mammalian 
Cells, 1987 (J. H. Miller and M. P. Calos eds., Cold Spring Harbor Laboratory); and 
Methods in Enzymology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., 
respectively). 

5 The present invention encompasses purified, isolated, nucleic acid sequences 

encoding hERjS, including, e.g., the nucleotide sequence depicted in Figure 3 SEQ ID N0:1 
and subfragments derived therefrom, including without limitation transcriptional activation- 
competent fragments. An "isolated" or "purified" nucleic acid is a nucleic acid or 
polypeptide that is removed from its original environment (for example, its natural 

10 environment if it is naturally occurring). An isolated nucleic acid or polypeptide contains less 
than about 50%, preferably less than about 75%, and most preferably less than about 90%, 
of the cellular components with which it was originally associated. 

A nucleic acid that is "derived from" an hER/3 sequence is a nucleic acid 
sequence that corresponds to a region of the sequence, sequences that are homologous or 

15 complementary to the sequence, and "sequence-conservative variants" and "function- 
conservative variants". Sequence-conservative variants are those in which a change of one 
or more nucleotides in a given codon position results in no alteration in the amino acid 
encoded at that position. Function-conservative variants are those in which the amino acid 
sequence of hERj3 has been changed without altering the overall conformation and 

20 transcriptional activation function of the hERjS polypeptide, including, but not limited to, 
replacement of an amino acid with one having similar physico-chemical properties (such as, 
for example, acidic, basic, hydrophobic, and the like). A large number of candidate function- 
conservative hERI3 variants, as w^ell as fragments of hER/3 that retain transcriptional 
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activation activity, can be prepared using routine recombinant DNA manipulations as well as 
random or site-directed mutagenesis techniques. Furthermore, hER/S-derived variants or 
fragments that exhibit transcriptional activation activity can be identified using routine 
experimentation by employing the methods described herein, e.g., by co-expression with an 
5 appropriate reporter gene followed by measurement of reporter gene transcription in the 
presence and absence of an estrogen. 

In another embodiment, the present invention encompasses isolated, purified, 
nucleic acids comprising nucleotides 94-229 of the sequence depicted in Figure 3, SEQ ID 
N0;1, which encode amino acids 1-45 of hERj3, and sequence-conservative variants thereof. 

10 The nucleic acids of the present invention include purine- and pyrimidine- 

containing polymers of any length, either polyribonucleotides or polydeoxyribonucleotides or 
mixed polyribo-polydeoxyribo nucleotides. This includes single- and double-stranded 
molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA hybrids, as well as "protein nucleic 
acids" (PNA) formed by conjugating bases to an amino acid backbone. This also includes 

15 nucleic acids containing modified bases. 

The nucleic acids may be isolated directly from cells. Alternatively, PCR can 
be used to produce the nucleic acids of the invention, using either chemically synthesized 
strands or genomic material as templates. Primers used for PCR can be synthesized using 
the sequence information provided herein and can further be designed to introduce appropriate 

2 0 new restriction sites, if desirable, to facilitate incorporation into a given vector for 
recombinant expression. 

The nucleic acids of the present invention may be flanked by natural regulatory 
sequences, or may be associated with heterologous sequences, including promoters. 




enhancers, response elements, signal sequences, polyadenylation sequences, introns, 5'- and 
3'- noncoding regions, and the like. The nucleic acids may also be modified by many means 
known in the art. Non-limiting examples of such modifications include methylation, "caps", 
substitution of one or more of the naturally occurring nucleotides with an analog, 
5 internucleotide modifications such as, for example, those with uncharged linkages (e.g., 
methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with 
charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.). Nucleic acids may 
contain one or more additional covalently linked moieties, such as, for example, proteins 
(e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., 

10 acridine, psoralen, etc.), chelators (e.g., metals, radioactive metals, iron, oxidative metals, 
etc.), and alkylators. PNAs are also included. The nucleic acid may be derivatized by 
formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidate linkage. 
Furthermore, the nucleic acid sequences of the present invention may also be modified with 
a label capable of providing a detectable signal, either directly or indirectly. Exemplary 

15 labels include radioisotopes, fluorescent molecules, biotin, and the like. 

The invention also provides nucleic acid vectors comprising hER)3-encoding 
sequences or derivatives or fragments thereof. A large number of vectors, including plasmid 
and fungal vectors, have been described for replication and/or expression in a variety of 
eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple 

2 0 cloning or protein expression. The encoded hER/3-derived polypeptides may be expressed 
by using many known vectors, such as pUC plasmids, pET plasmids (Novagen, Inc., 
Madison, WI), or pRSET or pREP (Invitrogen, San Diego, CA), and many appropriate host 
cells, using methods disclosed or cited herein or otherwise known to those skilled in the 
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relevant art. The particular choice of vector/host is not critical to the practice of the 
invention. 



for cloning or expression, one or more markers for selection in the host, e.g. antibiotic 
5 resistance, and one or more expression cassettes. The inserted hER/3-encoding sequences 
may be synthesized by standard methods, isolated from natural sources, or prepared as 
hybrids, etc. Ligation of the hER/S-encoding sequences to transcriptional regulatory elements 
and/or to other amino acid coding sequences may be achieved by known methods. Suitable 
host cells may be transformed/transfected/infected as appropriate by any suitable method 
10 including electroporation, CaCl2 mediated DNA uptake, viral vector-mediated DNA delivery, 
fungal infection, microinjection, microprojectile, or other established methods. 



and plant and animal cells, especially mammalian cells. Of particular interest are E. coli, B. 
Subtilis, Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Schizosaccharomyces 

15 pombi, SF9 cells, C129 cells, 293 cells, Neurospora, and HepG2 cells, CHO cells, COS 
cells, HeLa cells, and immortalized mammalian myeloid and lymphoid cell lines. Preferred 
replication systems include M13, ColEl, SV40, baculovirus, lambda, adenovirus, and the 
like. A large number of transcription initiation and termination regulatory regions have been 
isolated and shown to be effective in the transcription and translation of heterologous proteins 

2 0 in the various hosts. Examples of these regions, methods of isolation, manner of 
manipulation, etc. are known in the art. Under appropriate expression conditions, host cells 
can be used as a source of recombinantly produced hER/3-derived peptides and polypeptides. 



Recombinant cloning vectors will often include one or more replication systems 



Appropriate host cells included bacteria, archebacteria, fungi, especially yeast, 



Advantageously, vectors may also include a transcription regulatory element 
(i.e. , a promoter) operably linked to the hER/3 portion. The promoter may optionally contain 
operator portions and/or ribosome binding sites. Non-limiting examples of bacterial 
promoters compatible with E. coli include: jS-lactamase (penicillinase) promoter; lactose 
5 promoter; tryptophan (trp) promoter; arabinose BAD operon promoter; lambda-derived P, 
promoter and N gene ribosome binding site; and the hybrid tac promoter derived from 
sequences of the trp and lac UV5 promoters. Non-limiting examples of yeast promoters 
include triose phosphate isomerase promoter, 3-phosphoglycerate kinase promoter, 
glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter, galactokinase (GALl) 

10 promoter, galactoepimerase promoter, and alcohol dehydrogenase (ADH) promoter. Suitable 
promoters for mammalian cells include without limitation viral promoters such as that from 
Simian Virus 40 (SV40), Rous sarcoma virus (RSV), adenovirus (ADV), and bovine 
papilloma virus (BPV). Mammalian cells may also require terminator sequences and poly 
A addition sequences and enhancer sequences which increase expression may also be 

15 included; sequences which cause amplification of the gene may also be desirable. 
Furthermore, sequences that facilitate secretion of the recombinant product from cells, 
including, but not limited to, bacteria, yeast, and animal cells, such as secretory signal 
sequences and/or prohormone pro region sequences, may also be included. These sequences 
are well described in the art. 

2 0 Nucleic acids encoding hERj3-derived polypeptides may also be introduced into 

cells by recombination events. For example, such a sequence can be introduced into a cell, 
and thereby effect homologous recombination at the site of an endogenous gene or a sequence 
with substantial identity to the gene. Other recombination-based methods such as 



nonhomologous recombinations or deletion of endogenous genes by homologous 
recombination may also be used. 

The nucleic acids of the present invention find use as templates for the 
recombinant production of hERjS-derived peptides or polypeptides. 

5 

hERjS-derived polypeptides 

The present invention encompasses purified hERjS-derived polypeptides 
comprising amino acids 1-45 of hERi3 and further comprising all or part of the amino acid 
sequence depicted in Figure 4, SEQ ID N0:2, and function-conservative variants thereof, 

10 i.e., variants that exhibit estrogen-induced transcriptional activation activity. Also 
encompassed by the invention are peptides comprising amino acids 1-45 of SEQ ID NO: 2 and 
function-conservative variants thereof. 

Nucleic acids comprising hER/3-coding sequences can be used to direct the 
expression of hERjS-derived polypeptides in intact cells or in cell-free translation systems. 

15 The known genetic code, tailored if desired for more efficient expression in a given host 
organism, can be used to synthesize oligonucleotides encoding the desired amino acid 
sequences. The phosphoramidite solid support method of Matteucci et al, 1981, 7. Am, 
Chem. Soc, 103:3185, the method of Yoo et ai, 1989, 7. BioL Chem. or other 

well known methods can be used for such synthesis. The resulting oligonucleotides can be 

2 0 inserted into an appropriate vector and expressed in a compatible host organism. 

The polypeptides of the present invention, including function-conservative 
variants of the disclosed hER/S sequences, may be isolated from wqld-type or mutant human 
cells, or from heterologous organisms or cells (including, but not limited to, bacteria, fungi, 



insect, plant, and mammalian cells) into which an hERj3-derived protein-coding sequence has 
been introduced and expressed. Furthermore, the polypeptides may be part of recombinant 
fusion proteins. 

Polypeptides may be chemically synthesized by commercially available 
5 automated procedures, including, without limitation, exclusive solid phase synthesis, partial 
solid phase methods, fragment condensation or classical solution synthesis. The polypeptides 
are preferably prepared by solid phase peptide synthesis as described by Merrifield, 1963, 
y. Am, Chem, Soc, 85:2149. 

"Isolation" or "purification" of an hER/J-derived polypeptide refers to the 

10 isolation of the polypeptide in a form that allows its transcriptional activation activity to be 
measured without interference by other components of the cell in which the polypeptide is 
expressed. Methods for polypeptide purification are well-known in the art, including, without 
limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase 
HPLC, gel filtration, ion exchange and partition chromatography, and countercurrent 

15 distribution. For some purposes, it is preferable to produce the polypeptide in a recombinant 
system in which the hER/S-derived protein contains an additional sequence tag that facilitates 
purification, such as, but not limited to, a polyhistidine sequence. The polypeptide can then 
be purified from a crude lysate of the host cell by chromatography on an appropriate solid- 
phase matrix. Alternatively, antibodies produced against an hER/3-derived protein or against 

2 0 peptides derived therefrom can be used as purification reagents. Other purification methods 
are possible. 

The present invention also encompasses derivatives and homologues of hER/S- 
encoded polypeptides. For some purposes, nucleic acid sequences encoding the peptides may 
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be altered by substitutions, additions, or deletions that provide for functionally equivalent 

molecules, i.e., function-conservative variants. For example, one or more amino acid 
residues within the sequence can be substituted by another amino acid of similar properties, 
such as, for example, positively charged amino acids (arginine, lysine, and histidine); 
5 negatively charged amino acids (aspartate and glutamate); polar neutral amino acids; and non- 
polar amino acids. 

The isolated polypeptides may be modified by, for example, phosphorylation, 
sulfation, acylation, or other protein modifications. They may also be modified with a label 
capable of providing a detectable signal, either directly or indirectly, including, but not 
10 limited to, radioisotopes and fluorescent compounds. 



hERj3-specific Antibodies 

The present invention encompasses antibodies that specifically recognize hER/3- 
derived peptides and polypeptides, including without limitation antibodies that recognize 

15 hERjS but not, e.g., hERa, and those that recognize hER(3^ but not hER/Sj, Such hERjS- 
specific antibodies can be used conventionally, e.g., as diagnostic reagents or as reagents for 
purification of hER/?-derived polypeptides. Other uses include immunocytochemical 
localization of hER^S; gel shift assays; and "pull-down" experiments to identify protein co- 
activators associated with hER^. 

2 0 hER^-specific antibodies according to the present invention include polyclonal 

and monoclonal antibodies. The antibodies may be elicited in an animal host by 
immunization with hER(3 immunogenic components or may be formed by in vitro 
immunization (sensitization) of immune cells. The immunogenic components used to elicit 
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the production of antibodies may be isolated from any cell source or may be chemically 
synthesized. The antibodies may also be produced in recombinant systems programmed with 
appropriate antibody-encoding DNA. Alternatively, the antibodies may be constructed by 
biochemical reconstitution of purified heavy and light chains. The antibodies include hybrid 
5 antibodies (i.e., containing two sets of heavy chain/light chain combinations, each of which 
recognizes a different antigen), chimeric antibodies (i.e., in which either the heavy chains, 
light chains, or both, are fusion proteins), and univalent antibodies (i.e., comprised of a 
heavy chain/light chain complex bound to the constant region of a second heavy chain). Also 
included are Fab fragments, including Fab' and F(ab)2 fragments of antibodies. Methods for 

10 the production of all of the above types of antibodies and derivatives are well-known in the 
art. For example, techniques for producing and processing polyclonal antisera are disclosed 
in Mayer and Walker, 1987, Immunochemical Methods in Cell and Molecular Biology, 
(Academic Press, London). The general methodology for making monoclonal antibodies by 
hybridomas is well known. Immortal antibody-producing cell lines can be created by cell 

15 fusion, and also by other techniques such as direct transformation of B lymphocytes with 
oncogenic DNA, or transfection with Epstein-Barr virus. See, e.g., Schreier et aL, 1980, 
Hybridoma Techniques, Panels of monoclonal antibodies produced against hER/3 epitopes can 
be screened for various properties; i.e., for isotype, epitope affinity, etc. 



2 0 unlabeled or labeled by standard methods, as the basis for immunoassays. The particular 
label used will depend upon the type of immunoassay used. Examples of labels that can be 
used include but are not limited to radiolabels such as ^^P, ''^I, and "'^C; fiuorescent labels 
such as fluorescein and its derivatives, rhodamine and its derivatives, dansyl and 



Antibodies against hERjS-derived immunogenic components can be used. 



umbelliferone; chemiluminescers such as luciferia and 2,3-dihydrophthal-azinediones; and 
enzymes such as horseradish peroxidase, alkaline phosphatase, lysozyme and glucose-6- 
phosphate dehydrogenase. 

5 Applications 

The methods and compositions of the present invention can be used to identify 
compounds that interact with hER/3, either to activate or to inhibit its transcriptional activation 
function. Such compounds include, without limitation, co-activator proteins, as well as 
estrogens and other steroids, steroid-like molecules, or non-steroid-like molecules that act as 
10 agonists or antagonists. Screening methods can also be used to identify tissue-specific 
estrogens. 

Identification of hER/3-interactive compounds can be achieved by cell-free or 
cell-based assays. In one set of embodiments, purified hER/3 is contacted with a labelled 
ligand, such as, e.g., IV-jS estradiol, in the presence of test compounds to form test reactions, 

15 and in the absence of test compounds to form control reactions. The labelled moiety may 
comprise a radiolabel (such as, e.g., or '^^I) or a fluorescent molecule. Incubation is 
allowed to proceed for a sufficient time and under appropriate conditions to achieve specific 
binding, after which binding of labelled estradiol to hERfi is measured (by monitoring, e.g., 
radioactivity, flurorescence, or fluorescence polarization). In one embodiment, hERjS 

2 0 produced in E. coli (as described in Example 1 below) is adsorbed to the wells of a microtiter 
dish and incubated with [^H]-17/3 estradiol in the absence or presence of test compounds 
(see, e.g., Example 2 below). Alternatively, soluble receptor is incubated with the labelled 
ligand in the absence or presence of test compounds, and bound ligand is separated from free 




ligand. either by filtration on glass fiber filters or by using dextran-coated charcoal. See, 
e.g., Hulme, cd., Receptor-ligand Interactions: A Practical Approach, IRL Press, NY, 1992). 

Whole cell binding assays may also be used in which bound ligand is separated 
from free ligand by rinsing. Cells used in these assays may either contain endogenous 
5 receptor, or may overexpress the receptor subsequent to stable or transient transfection or 
infection of an hER/S gene or cDNA, Non-limiting examples of suitable cells include COS 
cells, Hela cells, CHO cells, human umbilical vein endothelial cells (HUVEC), and yeast. 
Once a compound has been identified as an hER/S-interactive compound by its binding 
activity, further in vivo and in vitro tests may be performed to determine the nature and extent 

10 of activity, i.e., as an agonist or antagonist (see below). 

hERi3-interactive compounds may also be identified using cell-based assays that 
measure transcriptional activation or suppression of endogenous or transfected estrogen- 
responsive genes. For example, agonists (such as, e.g., ITjS-estradiol) block interleukin-ljS 
induction of endogenous E-selectin in primary human umbilical vein endothelial cells 

15 (HUVEC) that express hER/S. Antagonists (such as, e.g., ICI-182780) block the agonist 
activity of 17/8-estradiol. Non-limiting examples of other suitable endogenous estrogen- 
responsive promoter elements include those that regulate endothelin-1 (ET-1); HDL receptor 
(scavenger receptor type II); and enzymes involved in coagulation and fibrinolysis (such as, 
e.g., plasminogen activator inhibitor-1 and complement C3). Any promoter element that 

2 0 responds to estrogen may be used as an appropriate target, including, e.g., the NFkB binding 
site or the apolipoprotein Al gene enhancer sequence. 

In one set of embodiments, appropriate host cells are transfected with an 
expression vector encoding hER(3 and the transfectants are incubated with or without estradiol 



in the presence or absence of test compounds. hER^ activity is assessed by measuring 
transcriptional activation of the target sequence. This may be achieved by detection of 
mRNA (using, e.g., Northern blot analysis) and/or by detection of the protein (using, e.g., 
immunoassays or functional assays). If activation of the target sequence initiates a 
5 biochemical cascade, downstream biological events may also be measured to quantify hER/3 
activity. hERi3-interactive compounds are identified as those that positively or negatively 
influence target sequence activation. 

In another set of embodiments, appropriate host cells (preferably, bacterial or 
yeast cells) are co-transfected with an expression vector encoding hER/? and a reporter 

10 plasmid containing a reporter gene downstream of one or more estrogen response elements 
(EREs). Transfected cells are incubated with or without estradiol in the presence of absence 
of test compounds, after which hER/5 activity is determined by measuring expression of the 
reporter gene. In a preferred embodiment, hER/3 activity is monitored visually. Non-limiting 
examples of suitable reporter genes include luciferase, chloramphenicol acetyl transferase 

15 (CAT), and green fluorescence protein. 

Preferably, the methods of the present invention are adapted to a high- 
throughput screen, allowing a multiplicity of compounds to be tested in a single assay. 
Candidate estrogens and estrogen-like compounds include without limitation 
diethylstilbesterol, genistein, and estrone. Other hER/3-interactive compounds may be found 

2 0 in, for example, natural product libraries, fermentation libraries (encompassing plants and 
microorganisms), combinatorial libraries, compound files, and synthetic compound libraries. 
For example, synthetic compound libraries are commercially available from Maybridge 
Chemical Co. (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates 



(Merrimack, NH), and Microsource (New Milford, CT). A rare chemical library is available 
from Aldrich Chemical Company, Inc. (Milwaukee, WI), Alternatively, libraries of natural 
compounds in the form of bacterial, fungal, plant and animal extracts are available from, for 
example, Pan Laboratories (Bothell, WA) or MycoSearch (NC), or are readily producible. 
5 Additionally, natural and synthetically produced libraries and compounds are readily modified 
through conventional chemical, physical, and biochemical means (Blondelle et al., TibTech 
14:60, 1996). hER/3 binding assays according to the present invention are advantageous in 
accommodating many different types of solvents and thus allowing the testing of compounds 
from many sources. 

10 Compounds identified as hERjS agonists or antagonists using the methods of 

the present invention may be modified to enhance potency, efficacy, uptake, stability, and 
suitability for use in therapeutic applications, etc. These modifications are achieved and 
tested using methods well-known in the art. 

15 

Description of the Preferred Embodiments 

The following examples are intended to illustrate the present invention without 

limitation. 

2 0 Example 1: High-level Expression of Human hERj3 in E. coli 

Human hER^ according to the present invention is overexpressed in E. coli 
strain BL21(DE3) using, for example, the pET15B vector. A 10-ml overnight culture is used 
to inoculate 1 liter LB medium containing 100 /xg/ml ampicillin. Cultures are grown at 37°C 




and then induced by the addition of 1 mM IPTG. After an additional incubation for 2h at 
25 °C, cells are harvested by centrifugation at 10,000 X G for 30 minutes and resuspended 
in 100 ml of a buffer containing 50 mM Tris-HCl, pH 7.4 -150 mM NaCl. Cells are lysed 
using a French press, and insoluble material is pelleted by centrifugation. The supernatant 
5 solution is recovered and stored at -70 °C. 



Example 2: Estrogen Receptor-j3 Ligand Binding Assay 

For determining the ability of a particular compound to bind hER/3, 100 /xl of 

10 the receptor preparation described in Example 1 above, diluted in assay buffer (Dulbecco's 
phosphate buffered saline (Gibco #14200-075) supplemented with 1 mM EDTA), is added to 
each well of a high-binding masked microtiter plate (Wallac #1450-511, Gaithersburg MD). 
10 /il of test compound (or vehicle) and 10 //I of pH]-17/?-estradiol are added to each well, 
and the plate is incubated at room temperature for 4-6 hours. Unbound material is aspirated, 

15 and the plate is washed three times with 300 ^1 of assay buffer. Then, 150 ixl of scintillation 
cocktail (Optiphase Supermix, Wallac #1200-439) is added per well, and the plate is sealed 
and agitated for at least 5 min. Bound radioactivity is measured by scintillation counting. 

Test compounds are initially tested at a concentration of 1,5 ^g/ml 
(approximately 5 ^M for a compound having a molecular mass of 300). Positive compounds 

2 0 are then re-tested at a number of different concentrations to determine the IC50. 

Data are expressed as percent inhibition of specific binding. Exploratory data 
analysis (EDA) is performed on raw data to check for non-normality and non-homogeneity 
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of variance. The maximum likelihood Box-Cox transformation, which maximizes the 
normality, homogeneity of variance, and goodness of fit of the data, is then obtained. Based 
on the result, the appropriate transformation of the data (no transformation, square root 
transformation, or logarithmic transformation) is used for model fitting. The Huber M- 
5 estimator is used to down weight any outlying transformed observations for analysis of 
variance and dose-response curve fitting. 

For ANOVA, multiple comparisons LSD p-values are computed. Re- 
transformed summary statistics (mean, s.d, s.e.m.) are obtained for each treatment group. 

For dose-response curve fitting, a four parameter logistic model on the 
10 transformed, weighted data are fit. The four parameters are min, max, slope, and ED50, 
where ED50 is defined as the dose which corresponds to midway between the estimated max 
and min. All of the parameters and confidence intervals are re-transformed back to the 
original units of the data. A further transformation into percent inhibition (using estimated 
min and max) is performed. 
15 Using this assay, the following values were obtained for reference compounds: 

IC50 95% confidence limits 

17(3-estradiol 6.7 nM 6 - 7,5 nM 

diethylstilbestrol 21 nM 14 - 31 nM 

genistein 1.6 nM 1.4-1.8 nM 

20 
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All patents, applications, articles, publications, and test methods mentioned 
above are hereby incorporated by reference. 

Many variations of the present invention will suggest themselves to those 
skilled in the art in light of the above detailed description. Such obvious variations are within 
the full intended scope of the appended claims. 



